Influence of reciprocal arcs on the degree distribution and degree correlations 
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Reciprocal arcs represent the lowest order cycle possible to find in directed graphs without self- 
loops. Representing also a measure of feed-back between vertices, it is interesting to understand how 
reciprocal arcs influence other properties of complex networks. In this paper we focus on influence of 
reciprocal arcs on vertex degree distribution and degree correlations. We show that there is a funda- 
mental difference between properties observed on the static network compared to the properties of 
networks which are obtained by simple evolution mechanism driven by reciprocity. We also present 
a way to statistically infer the portion of reciprocal arcs which can be explained as a consequence of 
feed-back process on the static network. In the rest of the paper the influence of reciprocal arcs on 
a model of growing network is also presented. It is shown that our model of growing network nicely 
interpolates between BA model for undirected and the BA model for directed networks. 

PACS numbers: 89.75.Hc, 89.75.Fb, 05.65. +b, 05.10.Gg 



I. INTRODUCTION 

Most of real networks as for example Internet 
WWW 0] or biological webs etc. show interesting 
topological properties compared with simple models of 
random graphs. Today it is well known that most of 
the real networks exhibit certain properties such as fat 
tail degree distributions or small- world effect etc. 0, Q. 
Reading literature on different types of networks a reader 
will find a huge number of papers describing different 
types of correlations in cornplex networks. Assortativ- 
ity 01, clustering coefficient reciprocity [J, k-cores 0, 
rich-club coefficient Triad Significance Profile [ll| 
and many other measures related to correlations are fre- 
quently reported. Although the identification of these 
correlations and their reporting in various empirical com- 
plex networks has significantly improved our understand- 
ing of the field, the question of interrelations of correla- 
tion measures naturally emerges. Is it really surprising to 
find, for example, both the strong rich-club behavior and 
strong degree correlations in the network? The answer is 
~ clearly not. Today, it is a well known fact that most of 
the real networks are correlated. Nevertheless, there is 
still a huge gap in our understanding of how exactly cer- 
tain types of correlation-related measures influence other 
correlation-related measures. In this paper we will try to 
bridge a part of that gap relating the reciprocity measure 
to degree sequence and to degree correlations. 

The directed network represents an interesteng sub- 
group of real networks which allow movement in just one 
direction. Reciprocity Q of complex networks is frac- 
tion of arcs which have their counterparts showing in the 
opposite direction compared to the total number of arcs 
i.e. every bidirectional arrow in the directed graph is 
considered as composed of two reciprocal arcs. It can be 
said that it is in fact a measure of how much is directed 
network similar to undirected one. Reciprocity was also 
shown as an important feature for percolation on directed 
networks [T2| . In previous work we have also shown that 



the reciprocity is a very stable correlation measure of all 
the investigated measures in the case of Wikipedia net- 
works ensemble 



In the paper [3| the influence of the broad class of de- 
gree correlations on the reciprocity measure is described 
and quantified. In this paper our aim is exactly the oppo- 
site i.e. to find a way to quantify the influence of recipro- 
cal arcs on degree correlations and degree distributions of 
complex networks. More precisely, first we focus on ran- 
dom addition of reciprocal arcs in the underlying static 
network. This process of transformation of unidirectional 
arcs into the bidirectional ones can be justified in many 
ways. First, it is the simplest possible choice of creating 
reciprocal links in the network which already has some 
structure. Second, it is the logical model of information 
return in the case of the information networks like an 
e-mail or WWW network. 



II. INFLUENCE OF RECIPROCITY ON 
DEGREE CORRELATIONS IN STATIC 
NETWORKS 

As a null hypothesis it is reasonable to suppose that 
the mutual functional relationships between vertices are 
distributed completely randomly over the whole network. 
In other words, that means that we suppose that the mu- 
tual functional relationship does not depend in any way 
on the degrees of vertices or any other measurable net- 
work quantity. We suppose that reciprocal arcs can form 
only between vertices which are already connected. The 
question we address is the following: how reciprocal arcs, 
formed in this way, transform the degree distribution and 
correlations between degrees in a given complex network? 
The model is defined with the initial network as an in- 
put. On the starting network the unidirectional arcs are 
transformed with probability p into reciprocal ones, and 
with probability I — p are left unchanged. After this 
process the properties of the new network are measured 
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again. We show that using the inversion of the transfor- 
mation process we can infer the most probable starting 
configuration of the network. 

In the following we distinguish between bidirectional 
arc as a single arc which is pointing in two directions 
contrary to some other analyses in which reciprocal arcs 
are represented as two arcs connecting two vertices i and 
j in opposing directions 0, [3l • Every vertex of the net- 
work is described by 3 numbers - the first represents 
exclusive in-degree of the vertex, the second represents 
exclusive out-degree of the vertex, and the third repre- 
sents its bidirectional degree. Exclusive in (or out) degree 
is the number of unidirectional in (or out) arcs which are 
attached to a given vertex. In the following these degrees 
will be designated as A; = (fc^, fco, kr). 

From the initial network we can extract the following 
information as initial conditions for the observed trans- 
formation process: L - number of arcs; N - number of 
vertices; - number of strictly unidirectional arcs; 

- number of strictly bidirectional arcs; L{k ^ q) - num- 
ber of unidirectional arcs which are pointing from the 
vertex of degrees fc to the vertex of degrees q; L{k ^ q) 

- number of bidirectional arcs which are connecting the 
vertices of degrees k and degrees q; N{k) - number of 
vertices with degrees fc. In the following text the conven- 
tion will be that if we observe unidirectional arcs, with fc 
are designated degrees of the starting vertex while with 
q are designated the degrees of the ending vertex. 

With these properties it is possible to represent ade- 
quately maximally random graph, as well as graphs with 
any given degree distribution and correlations between 
degrees of neighboring vertices. Information on correla- 
tions between degrees of neighboring vertices existing in 
the network is given by frequency of arcs which connect 
different vertices: 
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Although the statistics of de- 



V{k,^,q) = 
P{k,^,q) = 



L{k ^ q) 

L 

L{k ^ q) 
L ' 



(1) 



Probabilities V{k, q) and T{k, q) are defined as 
joint probabilities that the vertex of degrees fc is pointing 
to I is connected to the vertex of degrees q, with unidi- 
rectional arc in the former and with the bidirectional arc 
in the later case. The proper summation of this joint 
probabilities is 

oo oo 

^ V{k, ^,q)+ ^(fc' 9) = 1' (2) 



1=10 
k>q 



k— k 
1=10 



where q^^ (0,0,1), fco= (0,1,0) and (1,0,0). 
The summations are different for the bidirectional arcs 
compared with unidirectional arcs because 7'(fc, <->, q) = 



grces of neighboring vertices gives relevant information 
on correlation structure of the given network, and the 
one-vertex statistics can in principle be easily calculated 
from that information, from analytical aspect we will 
show that is much easier to explicitly calculate one vertex 
degree correlations described with 



P(fc) 



A^(fc) 

IT'- 



(3) 



where P{k) represents the joint probability that the ver- 
tex has degrees ki, ko i kr- 

In the studied model every unidirectional arc is trans- 
formed in a bidirectional one with the probability p. The 
equation which expresses a new joint probability that a 
vertex of degrees fc' is pointing to a vertex of degrees q' 
via unidirectional arc is: 

p'(fc', ^, q') = J2 ^(fe'- g): (4) 



where T represents the transition probability for the 
given process. A prime on the probabilities means that 
they are calculated after the transformation process, 
while the absence of a prime means that the probabil- 
ities are calculated from the given starting network. The 
summation is run over the set C of unidirectional arcs 
which fulfill the following conditions: (i) The number of 

neighbors S'^-'^ = fcp-* -I- fco"'-' -I- fc^"'^ is conserved for every 
vertex j, because transformation process does not cre- 
ate new arcs between vertices which are not neighbors 
already; (ii) Before and after the transformation process 

the following relations hold: k'^-'^ < k'f \ k'o''^ < k^'' and 
fcr"'^ > fcr"'\ The transition probability T written in a 
more detail is: 



r(fc',^,g'|fe,^,q) 



{l-p)Tik'^\h)T{k'^-l\ko-l) 
T{ql-l\q,-l)T{q'Jq,). (5) 



The first part of the equation ([5]) is probability that the 
unidirectional arc stays unidirectional after the transfor- 
mation process. Other unidirectional arcs attached to 
the vertices can be changed with probability p or stay 
unidirectional with probability 1 ~ p. The fact that in 
this case only other arcs are monitored is represented in 
equation by subtracting one arc from the out-degree of 
the out vertex and the in-degree of the in vertex. Proba- 
bilities of the transition T(x'\x), where x represents any 
of the aforementioned degrees are binomial probabilities 
i.e. 



Tix'\x)^i^^,jp^-^{l~pr . (6) 

New joint probability distribution of degrees of the ver- 
tices connected via the bidirectional arc V{k, <-^, q) is: 
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c 

+ ^r(q',^,fc'|fc,^,q)P(fc,^,q). (7) 

c 



The set C is fulfilling all the conditions as in equation 
(ID), while for the set C the following relations hold: (i) 

The number of neighbors S^^'^ = k!f^ + fc^,^' + fc^^^ is con- 
served for every vertex j, because transformation process 
does not create new arcs between vertices which are not 
neighbors already; (ii) Before and after the transforma- 
tion process the following relations hold: g-^"'^ < A:p\ 

Qo < ko and qr > K ■ In this equation the prob- 
abilities of transition have the similar meaning as in the 
equation (O 

r(fc',^,q'|fc,^,qr) = T{k[\h)T{k'^\ko) 

T{q'M.,)r{qM, (8) 

while 

r(fe',^,q'|fc,^,g) = pT{k[\h)T{k',%-l) 

T{q[\q,-l)r{q'M,), (9) 

and 

r(q',^,fe'|fc,^,q) = pT{k',\qo)T{k[\q,-l) 

T{q',\k,~l)r{q[\ki). (10) 

It is important to notice that in the equation ([8]) we do 
not need to worry about the arc which connects neighbor- 
ing vertices because it is, process invariant, bidirectional 
arc. Parameter p in equations Q and PH)) represents 
the probability of the transformation of unidirectional 
arc which connects neighbors to bidirectional arc. In- 
dividual probabilities of transition in equations ([S]), ([HI), 
and (jlOp are again binomial 

Similar equations are easy to write for the transfor- 
mation process of one vertex statistics. More precisely, 
equation 

(11) 

describes their probability of transformation of joint 
probability of one vertex degrees described with P{k) 
into the joint probability of one vertex degrees P'{k'). 

In all the aforementioned equations for the transforma- 
tion process we used the joint probability statistics, be- 
cause the total number of arcs over which this statistics 
is obtained does not change with the process. However to 
calculate the correlations existing in the network it is con- 
venient to use the conditional probabilities with respect 
to the type of the arc which connects two neighboring 
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statistics 
H,v £ {i,o,r} 


{k,,qA -^) 
{k,\ 


statistics 
tJ.,v £ {i,o,r} 


{k^,ki/\ 
(fc^g^l ^) 
{K\ 


1-vertex 
tJ.,v £ {i,o,r} 





Table I: Table of studied statistical moments and correlations. 
In the first column is designated if the averaging is performed 
over vertices, unidirectional or bidirectional arcs. The second 
column describes interesting product moments. 

vertices. The usual equation for conditional probability 
can be employed as 

P'ik',q'\^) = P'ik',^,q')/P'{-.). (12) 

The probability that two neighboring vertices are con- 
nected with unidirectional arc is P'(— >) = P(— >)(1 — p) 
and the probability that an unidirectional arc exists be- 
fore the transformation process is P{—>-) = /L, where 
is the number of unidirectional arcs before the trans- 
formation process. Similar equation holds for conditional 
probability that two vertices are connected via bidirec- 
tional arc: 

V'{k',q'\^) = P'{k',^,q')/P'{^), (13) 

where P'(^) = P(^) + pP{-^) and P(^) = i""/L. 
The is the number of bidirectional arcs before the 
transformation process. In the literature on complex 
networks Q it is usual to use statistics of average de- 
gree of neighbors of a given vertex to represent the cor- 
relations of degrees in the network. Such a measure is 
usually represented with figures of the average neighbor- 
ing degree dependence on the degree of the monitored 
vertex. It is easy to verify if the network is correlated 
or not by simple inspection of such a figure. In order to 
analytically describe degree-degree correlations resulting 
from this process wc will use a different measure much 
more common in usual statistical analysis. The observed 
and calculated correlations are just the nonccntralized 
product-moments of independent variables. In the fol- 
lowing we will loosely use the term correlations for all 
of the calculated statistical quantities both for statistics 
obtained on one vertex via Eq. (jlip or for statistics of 
degrees on connected pairs of vertices calculated via Eq. 
(HI) and (O. All the possible types of correlations we 
studied are summarized in Table [H 
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Figure 1: This figure shows an excellent agreement be- 
tween simulations of transformation process and our analyt- 
ical treatment. On x-coordinate is parameter p which repre- 
sents the probability of arc transformation. The y-coordinate 
represents the numerical value of given correlations. Simula- 
tions are designated by markers and analytical results with 
lines. The initial network is a Barabasi- Albert directed net- 
work of 10^ vertices. The simulations are averaged over 1000 
realizations. 

The equation for calculation of 1-vertex statistics is: 

{k[K)^Y.^'XP'^^')^ (14) 

k' 

for the case of in-out degree correlations. All other 1- 
vertex degree correlations are calculated in a similar way. 
There are two different equations with which we calcu- 
late 2-vertex degree correlations. The first one is for the 
calculation of 2-vertex degree correlations connected via 
unidirectional arcs. This type of correlations are desig- 
nated as ('I — >) in order to distinguish them from the 
2-vertex degree correlations calculated via bidirectional 
arcs ('I ^). The equation for the in-out degree correla- 
tions of the vertices connected via unidirectional arc is: 

{kWo\^) = Y.^Wo'P'{k',q'\^). (15) 

k' q' 

Using equations ^ and we can calculate degree cor- 
relations of unidirectionally connected pairs of vertices. 

The calculation of degree correlations of bidirectionally 
connected pairs of vertices is a bit trickier. The approx- 
imate equation for in-out degree correlations in this case 
is: 

2(fcX|^):^^fcXP'(fc',9'|H- (16) 

k q 

This equation is just an approximation because in order 
to analytically calculate expected correlations after the 
transformation process we have to sum over all degrees 
k' and q' thus including every bidirectionally connected 
pair two times, except for the pairs which have exactly the 
same degrees. The equation could be improved by tak- 
ing into account a new class of correlations just between 



Figure 2: The evolution of 2-vertex correlations, measured 
on pairs connected with unidirectional arcs, with change of 
parameter p. The initial network is a unidirectional version 
of Barabasi- Albert model of size lO'^ with average out degree 
equal to one. We averaged over 100 different realizations of 
the process on 10 different realizations of directed B-A model. 
The measured correlations are shown in Figure as markers. 
Analytical results are presented as full lines. Agreement be- 
tween simulation and analytics is excellent. 
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Figure 3: The change of some degree correlations of bidirec- 
tionally connected vertex pairs with parameter p. The initial 
network is the network of Spanish Wikipedia. We simulated 
100 different realizations of transformation process. There is 
a good agreement between the simulations (markers) and an- 
alytical results (full lines). Because analytical equation for 
these cases is approximate, the standard deviation margins 
are also plotted in the figure. The equation which describes 
the first type of correlations is Eq. (|21|l . The equation for 
second type of correlations is easily obtained with a change 
of indices. The equation for the third type of correlations is 
not presented in this paper, but can be obtained using the 
presented formalism. 



the bidirectionally connected pairs of vertices which have 
the same degrees, but as it will be shown later, this ap- 
proximation is more than good enough for estimating 
expected correlations for most of large enough networks. 
Using equations ([7]) and it is possible to calculate 
degree correlations of bidirectionally connected pairs of 
vertices. 
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If the calculated correlations are less or greater than 
the expected in the random network of the same degree 
distribution (configuration model) [H, [11] , then the net- 
work exhibits a structural tendency that the vertices of 
larger degrees are mutually connected less or more fre- 
quently. For example the expected in-out degree corre- 
lations of unidircctionally connected pairs of vertices is: 



k q 



(17) 



In Eq. (|T7|) we use the conditional probabilities that the 
two neighboring vertices are connected via unidirectional 
arc. The P'{k'\ — >) designates that the vertex of degrees 
k is the starting vertex of the conditioned arc, while 
'P'Iq'I ^) designates that the vertex with degrees q' is 
the end vertex of that arc. In the case of vertex statistics 
it can be written as 



(fco) 



(18) 



because of the fact that the vertex certainly has outgoing 
arc which connects it to a neighboring vertex. 

It is important to understand the fine difference be- 
tween the correlations between degrees of neighboring 



vertices and correlations of degrees of one vertex. In the 
following the correlations of degrees of neighboring ver- 
tices will be designated with the conditional type of arc, 
to make the distinction from one vertex correlations. The 
studied types of correlations are shown in TableU All the 
correlations are calculated analytically and checked with 
numerical simulations on networks of different sizes, den- 
sity of arcs and starting correlation structure. In Figures 
[l] [2] and [3] are shown some 1-vertex, 2- vertex unidirec- 
tional and 2-vertex bidirectional correlations calculated 
analytically and compared to simulations. 

For example the in - in degree correlations of neigh- 
boring vertices can be calculated using the expression 

{Kq',\-^) = Y.Kq'iP'ik',q'\-.), (19) 

k' q' 

and using Eq. ^ and (fT^ the final solution is 

{k'd\ ^) = (1 -p)2(fc,g,| ^)+pil~p){h\ (20) 

The other example are in - in degree correlations 
of bidirectioanlly connected vertices calculated with the 
presented scheme: 



J 



(Kill 



jl-p)^ {{hq,\ ^)P{^) +pP{^) {{hq,\ ^) - (fc,| ^))) 
P{^)+pP{^) 



(21) 



r 



In this case the factors P{^) and P{^) are also present 
because they did not cancel out as they did in Eq. ((20)) . 

To compute (|T^ and other possible correlations the 
following set of relations is useful: 




?i(l -p) 



((1 -p)n+p) 



n(l — p) 
1 

n(l 



p)+p 



1 

pn 



(22) 



The computation of all elementary correlations shown 
in Tabic [J can be more elegantly described with matri- 
ces of transformation T. If the observed correlations are 
represented as components of a "correlation vector" the 



studied process can be described with two different trans- 
formation matrices - one which transforms vector of one 
vertex correlations Ti„ and the second which transforms 
the vectors of the neighboring pairs correlations ■ 



III. TRANSFORMATION MATRIX 

Two transformation matrices differ one from another. 
The matrix of one-vertex correlations is the square ma- 
trix of rank 8. The complete one- vertex statistics of inter- 
est can be written as a vector S with components: S"^ = 
{{ki) — (ko) , (kr) , (kf), {kg), (k^) , {kiko), (kikr), (fco/cr)}. 
The expected correlations calculated after the transfor- 
mation process for one-vertex correlations can now be 
written as a simple linear equation 



(S'(p)) -Ti„(p)S(0), 



(23) 



where S'{p) represents vector of correlations after frac- 
tion p of unidirectional arcs is transformed into bidirec- 
tional arcs. Average in-degree is always equal to average 
out-degree and is therefore eliminated from the matrix. 
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The transformation matrix for 1-vcrtcx correlations Ti^, 
is: 



/ (i-p) 

/ 2p 





10 

p(l-p) {1-pf 

p(l-p) (1-p)^ 

2p(l-p) p^ P^ 1 



-p(l-p) p(l-p) 




-p(l-p) 








2p 

(1-p)^ 
p(l-p) (1-p) 








2p^ 



p(l-p) p(l-p) 








2p 


(1-p)/ 



This matrix also has its inverse: 



(24) 
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(1-p) 



(25) 

The inverse matrix can be of interest for statistical anal- 
ysis of real networks. If there is a reason to believe that 
the bidirectional arcs are completely random consequence 
of the mentioned transformation process and if one has a 
network model which does not take into account the bidi- 
rectional arcs - it can be tested using the inverse trans- 
formation matrix. It is easy to calculate the parameter p 



J 



as p = , where L*"* represents the number of bidirec- 
tional arcs in the network of interest, while L is the total 
number of arcs. Then using equation 



(26) 



one can find the vector of expected degree correlations be- 
fore the transformation process. Comparing then (S(0)) 
with the vector of correlations obtained by the model 
one can gain additional information on the structural 
role of bidirectional arcs and/or quality of the studied 
model. Such assumptions could be a good null model 
for a number of real world applications such as analysis 
of communication or traffic networks. In the companion 
paper (20| we will present application of this framework 
to the Wikipedia networks as a case study. 

The transformation of 2- vertex correlations is given by 
the expression 



(S'2,„(p))=T2„(p)S2„(0)-fb(p), 



(27) 



where 821, presents vector of 2-vertex product moments 
and b(p) additional vector containig terms like given 
in Eq. ^E^. The matrix of 2-vertex degree correlations 
is to big to be presented. For example, the equation for 
expected correlation of two bidirectional degrees of nodes 
connected via unidirectional arc is: 



{kWr\ 



>) = {kr,qr\ - 

+ {{hqi 



-P{{<1t\ -^) + (fcrl -^)) +P{{kiqr\ 

+ {hqo\ -^) + {koqi\ -^) + {koqo\ 



.)-(fc.|^)-(fc„|^) 



>) + {krq,\ ^)) 



Wo I 



1)(28) 



Nevertheless, there is enough information for interested 
reader to be able to reconstruct the 2-vertex transforma- 
tion matrix completely. 

It is important to note that correlations and 
{ko\ of the exit vertex are very different from the 
correlations obtained with 1-vertex statistics. It can be 
written using usual one-vertex statistics as: (fc^l — >) = 

^^0^°^ , while (fcol —!■) = Similarly, the correlations 

of in- vertex written by means of one- vertex statistics are 
= while (,„|^) = i|0). 

It can be shown that the correlations arising from 
transformation process are different from those that we 
would expect from the non-correlated network. The com- 
parison between real correlations in the network and the 
ones expected from the configuration model is shown in 
Figure ID 

Up to now we have shown that degree correlations can 
be strongly influenced by addition of bidirectional arcs. If 
the initial network is already very correlated transforma- 
tion process tends to amplify these correlations compared 



to configuration model. An example of such strongly cor- 
related networks is the Barabasi- Albert directed network 
that we used for comparison [T9| . In this model new ver- 
tices are attached to the old ones proportionally to the 
sum of in-degree of the old vertices and some parameter 
a. In our case the parameter a = 1 was chosen in the 
simulations and the starting network for BA evolution 
was Gilbert network of 10^ vertices connected with prob- 
ability 0.01. Other values of parameter a were tested as 
well. It is known that the properties of the directed BA 
network do not depend on size or degree sequence of ini- 
tial network in the thermodynamical limit. The final size 
of the simulated networks was 10^. 

A very important assumption of this analysis is that 
the initial network does not mutate/evolve in any other 
way during the transformation process. However, reality 
is that many complex networks evolve during the course 
of time and are highly nonequilibrium systems Q . There 
is a myriad of different rules one can think of in order to 
simulate some aspect of network growth and to study 
the influence of the addition of the bidirectional arcs in 
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Figure 4: The correlations {k'^q'o \ —*) resulting from the trans- 
formation process compared with the expected correlations 
of configuration model {k'^qol —*)Rand = ihd^Al^iJiA ^ jjj this 
case the transformation process clearly amplifies observed de- 
gree correlations. 

all these cases is impossible. We decided to study how 
addition of reciprocal arcs changes some very well stud- 
ied growth process. An obvious candidate for the study 
was preferential attachment growth with the addition of 
reciprocal arcs during the evolution process. 



IV. GROWTH MODEL 

In this model the crucial idea is that at formation of di- 
rected arc between new and old vertex there is a transfer 
of information about that event from the pointing ver- 
tex to the pointed vertex. In that case the old vertex 
can return the newly formed arc to the new vertex, thus 
forming a bidirectional arc. In the model this process of 
information return will be modeled by the probability r 
that the old vertex points back to the new vertex. 

The model can be described as a variant of the directed 
network growth by means of preferential attachment and 
formation of reciprocal arcs. More precisely: in every 
time step t there are t vertices labeled from 0, ...,t — 1 
present in the network, and a new vertex, labeled with t 
attaches to the network with m outgoing arcs. Every of 
those TO arcs is attached to some already present vertex s 
with probability proportional to the in-degree of the old 
vertex i.e. P{t ^ s) ~ ^'k^l • ^^'^ network is grown only 
using this rule, the model is a variant of the BA model 
for the growth of directed network with the atractivness 
parameter a ~ 0. The additional rule is that every of m 
new arcs, with probability r, can rccieve a reciprocal arc 
from the old vertex. With this additional rule the model 
is completely described. It is useful to note that based on 
previous work [13, one can expect that for the value of 
parameter r = the network will have in-degree distri- 
bution with exponent 7 = — 2, and for value of parameter 
r = 1 the network will have in-degree distribution with 



exponent 7 = — 3. 

Although we will later calculate the analytical expres- 
sion for the joint degree probability distribution for gen- 
eral parameter to, we will first present the solution for 
771 = 1 because it is easier to write it in a closed form. We 
will use master equation to calculate joint degree proba- 
bility distribution. Let p{ki, ko, s, t) present the probabil- 
ity that vertex introduced to the network at the moment 
s, at time t possesses in-degree ki and out-degree ko- 
In this treatment, for simplicity we will not use bidirec- 
tional degree because it would unnecessarily complicate 
the calculation. As the initial condition at time t = 1 we 
choose the network of two reciprocally connected vertices 
s = i s = 1. 

p{ki,ko,0,l) = 5k,,i5k„,i, 

p{h,ko,lA) = Sk„iSk^,i. (29) 

The probability p(/ei, /Co, s, for ko > ^, ki > 0, and 
s < t is 

fc — 1 

p{h,ko,s,t) = ' {1 - r)p{ki - l,ko,s,t - 1) + 

fc — 1 

' , . rp{ki - 1, fco - 1, s, i - 1) + 

(^l-j^^p{k,,ko,s,t-l). (30) 

The function Linit) is a random variable which is equal 
to the sum of all degrees present in the network at time 

t-i 

s=0 

= ^h^p{ki,ko,s,t - I). (31) 

ki s^ko 

The following approximation for the Lin(t) is very rea- 
sonable for a very big network 

L,„(0 ^ (i»„(t)) = (l + r)t, (32) 

i.e. we assume that the random variable Li„{t) is well 
described by its expected value. 

The equation for the vertex t which is just attaching 
to the network at the time t is 

p{ki,ko,t,t) = r5ki,i5k„^ + (1 -r)5k,fi5k^s- (33) 

Probability that the vertex s does not have any ingoing 
arc is 

p(0,fco,s,i) = (l-r)4„,i- (34) 

We sum the obtained joint probabilities p{ki,ko, s,t) 
that the vertex s at time t has degrees ki and kg, over 
all present vertices s to get the probability P{ki, ko,t), 
that the randomly chosen vertex at time t has degrees ki 
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5 




ki 8 



Figure 5: The event-set lattice used to calculate the stable 
distribution. The borders of the lattice segment in which 
paths contribute to the probability are designated with blue 
color. Red color designates one of the possible paths and 
arrows represent the possible directions of paths. 



and ko i.e. P{ki,ko,t)^J2s=oPi^'-^^o,s,t)/{t + l). We 
also assume that the distribution will be stable for large 

t i.e. P{ki,ko,t)*^^ P{ki,ko)- The described procedure 
results with the equations 



P(0, ko) - (1 - r)4„ 



(35) 



Pih > 1,1) = 

and 



r(l + r)4.,i + ih - 1)(1 - r)Pih - 1, 1) 



1 + r + ki 



(36) 



Pih > l,fco > 1) 



fc ,; - 1 



1 + r + k 

+rP{h^l,ko-l)] 



[(l-r)P(fc, -l,fco) 
(37) 



The Eq. ^ shows that P(l, 1) = r(l + r)/(2 + r). 
The simplest way to solve this set of equations is to sum 
contributions of all possible paths for probability distri- 
bution P{ki,ko). Equations ([M)) and ([57]) can be easily 
represented as the walk on the event lattice shown in 
Figure [5] The nodes of this lattice represent all the pos- 
sible events (degree combinations) of randomly choosing 



a vertex from the ensemble of networks generated by the 
studied process. Every movement to the right from the 
site {fci — 1, ko} to the site {ki, ko} is multiplying the prob- 
ability distribution attached to the site with the factor 
w+fi = (ki — 1)(1 — r)/(l + r + ki), while every diagonal 
movement from the site { fc^ — 1 , /cq — 1 } to the site { fc^ , fco } 
represents multiplying the probability distribution with 
the factor w+^+ = {ki — l)r/(l + r + ki). The value 
of the joint degree probability distribution P{ki,ko) is 
therefore equal to sum of the contributions of all possible 
paths from site {1,1} to the site {ki,ko}. Every path 
has ki — ko movements to the right and kg — 1 diagonal 



Theory 

□ felO' 

feio' 




Figure 6; In this figure it can be seen the agreement be- 
tween distribution of in-degrees of the process simulated over 
100 different realizations (markers) and the analytical solu- 
tion (full line). The parameter r has the value 0.2. 



movements and every of them has the same contribution 



n=2 



The number of distinct paths is (^'Z^) which is equal to 
the number of combinations of factors lo+.o and . 
The general expression for the joint degree distribution 
is therefore equal to 



n-1 
+ r + n 



(38) 



^ik.,ko) = eih - ko) - - 

\ko-lJ 2 + r {r + 3)ki-i 



(39) 



where denominator in the last factor (r + 3)/£.-i repre- 
sents Pochammer symbol, defined with relation {x)n ~ 
x{x+l)...{x+n — l). Nice property of this solution is that 



the correlations between degrees of one vertex are exactly 
computed and easily checked. In the limit of the big in- 
degree, using the representation of Pochammer symbol 
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by means of Gamma functions {x)n = ^r(^"'' i it is easy 
to show that the asymptotic behavior of the last factor 
IS lmife,^c -^^ - T-/^ I Q^^, ^ 



The exponent of the power-law-Hke tail is the prop- 
erty which can be easily checked for sufficiently large 
networks. We can sum over all the values of ko in the 
equations psp . p6p . and p7p . and use the approxima- 
tion of continuum to get the equation for the marginal 
in-degrec distribution. 

(1 + r)P{k,) = {k, - l)P{h - 1) - hP{h). (40) 

In the continuum approximation this equation can be 
written as: 



{l + r)P{k,) = - 
and has a solution 

P{h) ' 



d{hP{h)) 
dkj 



-(2+r) 



(41) 



(42) 



To check analytical solutions we have performed a sc- 
ries of simulations for different values of parameter r and 
different network sizes. For all monitored parameters and 
sizes of network we found a nice agreement between an- 
alytical solution and simulations (Fig. [6]). 



GROWTH MODEL FOR A GENERAL 
PARAMETER m 



It is possible to calculate joint degree distribution of 
the model for the general parameter m. We again use 
master equation: 



= 1(7) (l3^)'0-|:^)"'"'e(^-.-O 

■ Q ^"(1 - -l,ko-n,s,t- l)Q{ko - m - n), 

ri=0 ^ ^ 

(43) 



where Q{x) represents usual Heaviside Theta function 
with convention 0(0) = 1. Indices m, I and n combine 
all the possible combinations of: number of outgoing arcs 
formed on the new vertex, number of new arcs attached 
to the old vertex and the number of formed reciprocal 
arcs from which it is possible to create vertex with given 
degrees. To ease the calculation, we allowed the forma- 
tion of multiple arcs between two vertices, which in the 
thermodynamical limit does not influence the exact so- 
lution. The boundary condition of this set of equations 
is 

p(fc,,fc„,t,t) =^ n)^"(l-0'"""'^fc..n4„,rn. (44) 
n=0 ^ ^ 

I 



We again sum over all vertices and approximate the func- 
tion Lin[t) with its expected value Lin(t) ~ {Lin(t)) = 
(1 + r)mt. Using equation ([44)1 and assuming stable de- 
gree distribution in the thermodynamical limit the fol- 
lowing equation for the joint degrees distribution is ob- 
tained 



1 + r + ki 



h - 1 



'—-^P{k,,ko) = V r"(l-r)™-"(5,,.„4„,™ + -^e(fc,-l)e(A:o-m-l)rP(fc,-l,fc„~l) 
I + r \ n / 1 + r 



+ tl le(fc, - l)e(fc„ - to)(1 - r)P{h - 1, ko). 

I + r 



(45) 
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This equation can be solved in a manner similar to equa- 
tions ([35]), ([36l) . and (|37l) . In Figure [7] we present the 
event-set lattice with the contributions of the paths to 
the joint probability attached to every site. The contri- 
bution of every path is again identical for every lattice 
bond as soon as the path detaches from the line ko = m. 
The total contribution of the paths differ only in the num- 
ber of steps made on line ko = m for the 1 < ki < m. 
For 1 < fci < m and ko ~ m the joint degree probability 



P{ki,m) = 



1 + r 



ki-l 



ki-l 



ki 



l + r + h 
1 + r 

+-; r 

l + r + k 



1=1 



3=1 



+ r + j 



(46) 



Figure 7: The lattice used to calculate the stable distribution 
of the model for the parameter m = 4. Blue color designates 
the borders of the lattice area within which the paths con- 
tribute to the joint probability P{ki — 7,ko — 8), while the 
red path represents one of the possible contributing paths. 
Arrows represent the allowed directions of movement on the 
lattice, while the loops on the sites represent the additional 
coefficients ai contributing to joint degree probability on these 
sites. 



and a; represents the probability of binomial distribution 

o-i ~ (T)^'(-'^ ~ r)™"'. The equation for the case ki > m 
and ko — m is 



ki 



P(fc„m) = (l-r)^'-™P(m,m) [] - 



.7-1 



j—m-\-l 



r+j 



(47) 



The contribution of paths which separated from the line 
ko = m i I < ki < m at the site (ki ^ k[,ko = m) is: 



P{ki,ko)^(k'.,m) = P{k'i,m) 



ko TTL 

ki - k' - 1 



(l-r)^-° 



-k'- — fco+m 



(fc, -l)!(r + 3)fc;_i 
(fc^-l)!(r + 3),._i 



r 



(48) 



where 7r(fc^, m) represents the sum over the possible paths 
after detachement from the line ko = m. The whole solu- 
tion is now easy to write using equations (|46)) . (|47l) . and 
PS)) . but it is complicated and not very informative. Nev- 
ertheless, it is interesting to monitor the behavior of Eq. 
(PS)) . It can be verified that for P{ki < m,ko) the joint 
degree distribution can increase depending on the initial 
parameters of the model. On the other hand, in the limit 
ki ^ m we expect the fall of the joint probability distri- 
bution. This implies that for certain range of parameters 
the distribution has a nontrivial mode. Such a behav- 
ior can be easily checked if the equation (j45p is summed 
over all out-degrees ko £ [to, go), to obtain the marginal 
distribution of in-degree. In the range ki £ [1,to] the 
solution is 



ki-l 



P{h) = 



i=i 



ki-l 

TT — - 



ih - 1)! 

{l~iy. j^^ l + r + j 

(49)' 

The marginal in-degree distribution obtained analyti- 
cally coincide with the simulations rather well as shown 



on Fig. [HI The modal character of the in-degree distribu- 
tion is easily observed in this equation. The dependence 
of mode on the parameters r and to is shown in Figures 
[land [TO] 

The other important property of the distribution is 
its power law behavior in the tail. As can be seen in 
FigurefTOlthe exponent of the tail does not depend on the 
parameter m. Power law behavior of the tail is governed 
only by the parameter r as shown in Figure [H] Indeed 
in the continuum approximation, valid for fc^ ^ 1, the 
equation for the in-degree marginal distribution is 



(l + r)P(fc,) 



d{hP{h)) 



dki 



The solution of this equation is 



P{ki) 



-(2+r) 



(50) 



(51) 



and dependence of the value of the power law exponent 
with respect to parameter r is very clear. The equation 
for the power law exponent 7 = — 2 — r also confirms 
our claim that the model behavior interpolates between 
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N=ro° 

— Theory 



Figure 8: The marginal distribution of the in-degree calcu- 
lated from theory (full line) and simulations (markers), for 
3 different sizes of final networks. The averaging was per- 
formed over 100 simulation realizations, and parameters used 
are r = 0.15 and m = 18. There is an excellent agreement 
between theory and simulations. 




Figure 9: The marginal distribution of the in-degree calcu- 
lated from the theory for m = 10 and different values of pa- 
rameter r. The tail of distribution clearly depends on the 
the parameter r and the power law character of the tail is 
easily checked. The shape and the position of the mode also 
depends on the value of r. 



usual BA model with the exponent —3 and the directed 
BA model with the exponent —2. 

It is important to mention that this analytical discus- 
sion is valid up to certain point also for a little bit broader 
class of growth models. In the analytic treatment the dis- 
tribution of the outgoing arcs of the new vertex t (|^^ at 
the time f is a delta function. We can expect that this 
reasoning can be applied for a more general class of dis- 
tributions for the outgoing arcs of the vertex t at time t 
with the assumption that the mean field approximation 
is valid. In particular, we expect that this consideration 




Figure 10: The marginal distribution of the in-degree cal- 
culated from the theory for r = 0.4 and different values of 
parameter m. The existence and the position of the mode 
strongly depend on the value of parameter m. The tail of dis- 
tribution is independent of the parameter m and the power 
law character of the tail is easily checked. 



— 5 - Razdioba 

- - Binomna Razdioba 

' - Poissonova Razdioba: 
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Figure 11; This figure shows that there is no important dif- 
ference between behavior of the in-degree distribution for the 
different choices of the vertex t out-degree distribution at time 
t. The simulations are presented for the parameters m = 10, 
r — 0.2, N = 10® and 100 realizations. The presented distri- 
butions are cumulative in-degree distributions which are more 
comparable in the case of simulated models. The broader dis- 
tributions of initial out-degree show a bit smaller maximal 
degree, which we explain by stronger competition for the new 
vertices. 



will be valid for all unimodal discrete distributions with 
fast decaying tails. To test this assumption we examined 
cases in which the out-degree of the vertex t at time t is 
drawn from the Binomial and Poisson distribution. 
The Poisson distribution 



P{ko\m) 



(52) 
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is determined only by parameter m. For every monitored 
m of the original model, we made a new set of simulations 
with Poisson distribution with the same m. For Binomial 
distribution 

the case is a little bit more complicated because it is 
defined with two parameters: m - the expected number 
of outgoing arcs and Z maximal allowed out-degree of 
the vertex t at time t. For a broad choice of values of 
parameter Z, the results were very similar to the ones 
expected from the original model as 3> ^ as can be 
seen in Fig. [11] 

VI. CONCLUSION 

We have shown that reciprocal arcs can significantly 
influence the degree correlations in complex networks. 
In the first part of the paper we laid down a way to in- 
vestigate influence of randomly distributed bidirectional 
arcs on the overall degree correlations and have shown 
how this hypothesis can be tested in the case of real net- 
works. We also studied a simple model of the network 
growth which conserves expected fraction of reciprocal 
arcs. 

The analysis laid out in the first part of this paper 
focuses on the degree correlations represented as aver- 
age product moments. This analysis has its positive and 
negative side. The average product moments clearly do 
not contain as much information as the average neighbor 
degree functions Q and this is an obvious shortcoming 
of such a measure. On the other hand it is exactly the 
reason why product moments can be very useful for case 
studies. In the case of very correlated networks, some- 
times the frequency of degree statistics for the large de- 
grees is so scarce that it effectively shrinks the available 
configuration space for the null models which are trying 
to preserve correlations found in the network. This re- 



duction of available configuration space can sometimes be 
so huge that for connected pairs of vertices with large de- 
grees any result different from the already observed in the 
network is almost impossible to realize. In this case the 
product moments incorporate in themselves much larger 
number of viable network realizations so that the analy- 
sis of network with correlated null models is much better 
founded. 

In the companion paper [2^ , we apply the theory pre- 
sented in this paper to show that Wikipedia networks 
cannot be explained by random distribution of bidirec- 
tional arcs on the static network. In the same paper we 
used the presented growth model to explain in-degree 
distribution of the Wikipedia networks with very good 
results. 

From the theoretical point of view this model helps 
to understand possible mechanisms which create modes 
in the degree distributions of different scale-free directed 
networks. Furthermore this model is a good candidate to 
explain other empirical directed networks which combine 
power-law tails and nontrivial mode of the degree dis- 
tribution and the future work in this direction is clearly 
needed. It also represents one of the simplest growth 
models which preserves some type of local correlations. 
It is our opinion that the understanding of interrelations 
between different types of correlations in complex net- 
works heavily depends on such growth models. The val- 
idation of this claim is an important task for our future 
research. 
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